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The  continued  advancement  of  science  depends  on  shared  and  reproducible  data.  In  the  field  of 
computational  materials  science  and  rational  materials  design  this  entails  the  construction  of  large 
open  databases  of  materials  properties.  To  this  end,  an  Application  Program  Interface  (API)  follow¬ 
ing  REST  principles  is  introduced  for  the  AFLOWLIB.org  materials  data  repositories  consortium. 
AUIDs  (Aflowlib  Unique  IDentifier)  and  AURLs  (Aflowlib  Uniform  Resource  locator)  are  assigned 
to  the  database  resources  according  to  a  well-defined  protocol  described  herein,  which  enables  the 
client  to  access,  through  appropriate  queries,  the  desired  data  for  post-processing.  This  introduces 
a  new  level  of  openness  into  the  AFLOWLIB  repository,  allowing  the  community  to  construct 
high-level  work-flows  and  tools  exploiting  its  rich  data  set  of  calculated  structural,  thermodynamic, 
and  electronic  properties.  Furthermore,  federating  these  tools  would  open  the  door  to  collabora¬ 
tive  investigation  of  the  data  by  an  unprecedented  extended  community  of  users  to  accelerate  the 
advancement  of  computational  materials  design  and  development. 


I.  INTRODUCTION 

Data-driven  materials  science  has  gained  considerable 
traction  over  the  last  decade  or  so.  This  is  due  to  the  con¬ 
fluence  of  three  key  factors:  1)  Improved  computational 
methods  and  tools;  2)  Greater  computational  power; 
and  3)  Heightened  awareness  of  the  power  of  extensive 
databases  in  science  [1].  The  recent  Materials  Genome 
Initiative  (MGI)  [1,  2]  reflects  the  recognition  that  many 
important  social  and  economic  challenges  of  the  21st  cen¬ 
tury  could  be  solved  or  mitigated  by  advanced  materials. 
Computational  materials  science  currently  presents  the 
most  promising  path  to  the  resolution  of  these  challenges. 

The  first  and  second  factors  above  are  epitomized  by 
high-throughput  computation  of  materials  properties  by 
ab  initio  methods,  which  is  the  foundation  of  an  effective 
approach  to  materials  design  and  discovery  [3-12].  Re¬ 
cently,  the  software  used  to  manage  both  the  calculation 
work-flow  and  perform  the  analyses  have  trended  toward 
more  public  and  user-friendly  frameworks.  The  empha¬ 
sis  is  increasingly  on  portability  and  sharing  of  tools  and 
data  [13-15].  The  future  advance  of  computational  mate¬ 
rials  science  would  rely  on  interoperable  and  federatable 
tools  and  databases  as  much  as  on  the  quantities  and 
types  of  data  being  produced. 

A  principle  of  high-throughput  materials  science  is  that 
one  does  not  know  a  priori  where  the  value  of  the  data 
lies  for  any  specific  application.  Trends  and  insights  are 
deduced  a  posteriori.  This  requires  efficient  interfaces  to 
interrogate  available  data  on  various  levels.  We  have  de¬ 
veloped  a  simple  WEB-based  API  to  greatly  improve  the 


accessibility  and  utility  of  the  AFLOWLIB  database  [14] 
to  the  scientific  community.  Through  it,  the  client  can 
access  calculated  physical  properties  (thermodynamic, 
crystallographic,  or  mechanical  properties),  as  well  as 
simulation  provenance  and  runtime  properties  of  the  in¬ 
cluded  systems.  The  data  may  be  used  directly  (e.g.,  to 
browse  a  class  of  materials  with  a  desired  property)  or  in¬ 
tegrated  into  higher  level  work- flows.  The  interface  also 
allows  for  the  sharing  of  updates  of  data  used  in  previous 
published  works,  e.g.,  previously  calculated  alloy  phase 
diagrams  [16-28],  thus  the  database  can  be  expanded  sys¬ 
tematically. 

The  rest  of  this  paper  is  organized  as  follows.  The 
AFLOWLIB  libraries  are  presented  in  Section  II.  A  new 
materials  identifier  constructed  to  navigate  the  libraries 
is  introduced  in  Section  III.  The  data  provenance  and 
access  format  schemes  are  explained  in  Sections  IV  and 
V.  The  access  syntax  and  its  various  options  are  de¬ 
scribed  in  Section  VI.  A  few  examples  for  the  use  of  the 
API  and  two  computer  scripts  are  given  in  Section  VII. 
The  strategy  for  updates  is  mentioned  in  Section  VIII. 
A  brief  conclusion  is  included  in  Section  IX. 


II.  THE  AFLOWLIB  LIBRARIES 
MULTI-LAYERED  STRUCTURE 

At  its  core,  AFLOWLIB  consists  of  a  coordinated  set 
of  libraries  of  first-principles  data  describing  thermody¬ 
namic,  structural,  and  other  materials  properties  of  al¬ 
loy  systems  (Figure  1).  From  the  top,  it  is  adminis- 
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FIG.  1:  Schematic  structure  of  the  AFLOWLIB  consortium 


tered  through  a  large  SQL  database  [29]  organized  in 
layers  (reminiscent  of  other  more  developed  computer 
interfaces,  i.e.,  IEEE-POSIX  [30]).  Each  layer  consists 
of  searchable  entries  called  af  lowlib .  out.  The  SQL  in¬ 
terface  is  called  the  zero-layer  (Figure  2)  because  of  its 
structural  immanence. 

In  this  layered  organization,  each  af  lowlib.  out  can 
be  the  child  of  an  af  lowlib .  out-parent  or  the  parent  of 


an  af  lowlib .  out-child.  Each  af  lowlib.  out  is  identi¬ 
fied  by  a  name  and  an  address.  The  first  is  the  AUID 
(Aflowlib  Unique  IDentifier  -  $auid),  while  the  second  is 
the  AURL  (Aflowlib  Uniform  Resource  Locator  -  $aurl). 
The  structure  is  summarized  in  Figure  1. 

The  current  implementation  of  AFLOWLIB  includes 
three  layers  that  can  be  navigated  using  control  keywords 
and  absolute  paths. 


•  Project-layer.  This  layer  contains  information  about  the  project  to  which  the  data  belongs.  For  example, 
for  searches  in  alloys,  the  project’s  AURL  could  be  of  the  type  server  :AFL0WDATA/  followed  by  LIB1_RAW, 
LIB1_LIB,  LIB2_RAW,  LIB2_LIB,  LIB3_RAW,  or  LIB3_LIB  corresponding  to  the  single  element,  binary  and  ternary 
libraries  of  post-  or  pre-processed  data  respectively.  For  HTTP  access  [31],  the  AURL  would  be  translated  from 
$aurl=server :  /directory/  into  $web=http : //server/directory/.  Other  translations  are  considered  for 
future  implementations.  An  example  of  aflowlib. out  for  the  project-layer  is  depicted  in  Figure  3(a)  (with 
server=af lowlib . duke . edu). 

•  Set-layer.  This  layer  contains  information  about  one  or  more  systems  calculated  in  one  or  more  different 
configurations  (e.g.,  various  structural  prototypes,  different  unit  cells  required  for  phonons  calculations  using 
the  finite  difference  method,  etc.).  An  example  of  aflowlib. out  for  the  set-layer  is  depicted  in  Figure  3(b). 
To  facilitate  reproducibility,  species  making  up  the  $aurl  might  include  a  subscript  or  postfix  indicating  the 
pseudopotential  type.  For  example,  in  searches  in  this  layer  where  the  user  is  interested  in  the  Ag-Ti  system,  the 
AURL  could  be  of  the  type  $aurl=server :  AFL0WDATA/LIB2_RAW/AgTi_sv/.  Here  the  “_sv”  in  Ti_sv  indicates 
the  “sv”  type  of  pseudopotential  in  the  quantum  code  used  for  the  calculation,  in  this  case  VASP  [32].  Other 
identifiers  may  be  used  to  indicate  pseudopotentials  used  in,  for  example,  Quantum  Espresso  (QE)  [33]  or  for 
potentials  coming  from  other  ultrasoft  pseudopotential  libraries  [34] . 

•  Calculation-layer.  This  layer  contains  information  about  one  system  calculated  in  one  particular  configuration 
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(e.g.  AgTi  in  configuration  $prototype=66  (Cll*,  [35,  36])).  For  entries  in  this  layer,  the  AURL  could  be  of  the 
type  $aurl=server : AFL0WDATA/LIB2_RAW/AgTi_sv/66/.  This  $aurl  points  to  an  aflowlib.out  describing 
this  specific  structure  properties.  An  example  of  aflowlib.out  for  the  calculation-layer  is  depicted  in  Figure 
3(c).  The  calculated  geometry  (here  “66”)  is  one  of  the  strings  comprising  the  AFLOW  prototypes’  database. 
The  prototypes’  database  acts  as  a  look-up  table  to  common  Strukturbericht  designations  [35,  36]  and  may  also 
refer  to  geometries  included  in  the  Inorganic  Crystal  Structure  Database  (ICSD)  [37-39].  In  the  latter  case,  a 
legitimate  entry  would  be  something  like  .  .  . /AgTi_sv/ICSD_58369 .  AB/,  where  the  structure  composed  of  Ag 
and  Ti  is  calculated  in  the  configuration  defined  by  $prototype=ICSD_58369,  and  the  postfix  .AB  indicates  the 
species  ordering:  e.g.  .AB  or  .BA  for  Ag  and  Ti  in  AB  or  BA  positions,  respectively.  For  ternaries  the  postfix 
is  a  combination  of  A,B,C,  where  indices  can  be  repeated  to  create  binaries  (e.g.  AAB).  Furthermore,  the  non¬ 
element  X  can  be  included  to  indicate  vacancies  in  sublattices  (e.g.  from  Heusler  to  half-Heusler  systems  [40]). 
Valid  $prototype  choices  also  include  strings  of  the  type  “f  123”  following  the  enumeration  scheme  of  Hart  and 
Forcade  [41,  42].  Here  the  characters  f ,  b,  h,  s  indicate  fee,  bcc,  hep,  and  simple  cubic  structures  respectively. 
The  number  that  follows  designates  the  enumerated  prototype.  The  complete  list  of  structure  designations  can 
be  accessed  with  the  command  “aflow  — protos”  or  by  consulting  the  online  links.  The  options  are  illustrated  in 
the  AFLOW  manual  [13]. 


It  is  important  to  note  that  while  the  operations  of  con¬ 
catenating  strings  to  $aurl  are  reminiscent  of  a  UNIX 
computer  file-system,  the  final  $aurl  is  not  necessarily 
served  as  a  directory  or  a  file.  In  fact,  a  computer  dae¬ 
mon  (process  running  in  background)  dynamically  serves 
multiple  file-system  directories  for  the  same  HTTP- 
translated  $aurl.  The  implementation  was  so  chosen 
in  order  to  better  use  the  calculation-layer  data  belong¬ 
ing  to  different  project-layers  and  to  allow  an  $aurl  to 
contain  multiple  servers:  e.g.  $aurl=serverl ,  server2 : 
somewherel/AgTi_sv/ , somewhere2/AgTi_sv/.  In  fu¬ 
ture  updates  of  the  AFLOWLIB.org  API,  the  latter  col¬ 
lection  will  allow  users  to  download  AgTi’s  data  from 
serverl  and  server 2  concurrently  and  transparently. 


III.  MATERIALS  OBJECT  IDENTIFIER: 

AFLOWLIB  UNIQUE  IDENTIFIER 

Following  the  spirit  of  the  Digital  Object  Identifier 
(DOI)  [43]  every  entry  in  our  database  is  identified  by 
an  alpha-numeric  string  called  an  AUID  (Aflowlib  Unique 
IDentifier  -  $auid).  In  future  publications,  we  will  use 
the  AUID  as  a  permanent  string  to  the  correct  AURL, 
which  might  get  relocated  across  servers.  By  giving  a 
short  AUID,  the  user  can  identify  1)  WEB  locations  for 
the  deliverables,  2)  appropriate  points  of  contact  (name, 
emails  or  other  means  of  identifying  corresponding  au¬ 
thors),  and  3)  copyright  and  publication  date.  We  have 
chosen  this  linked  approach  so  that  future  databases  and 
expansions  of  the  current  ones  (e.g.,  locations,  servers, 
personnel,  etc.)  will  not  affect  the  retrieval  of  the  ob¬ 
jects.  A  simple  WEB-FORM  will  be  offered  to  the  com¬ 
munity  to  insert  AUIDs  and  to  retrieve  the  informa¬ 
tion.  The  AIUD  is  constructed  from  a  64-bit  CRC  check¬ 
sum  (cyclic  redundancy  check  [44])  of  concatenated  in¬ 
put  and  output  files.  For  example,  the  bcc  Ag-Ti  struc¬ 
ture  66  described  above  would  be  accessed  using  the 
AUID  “$auid=af low:448e!78el9e9e973”.  The  negligi¬ 


ble  probability  of  duplicates  with  this  checksum  length, 
and  the  opportunity  of  adding  a  “void  character”  to  the 
input  files  (a  grain  of  salt)  guarantee  the  uniqueness  of 
the  AUID  and  its  expandibility  for  many  years  to  come. 
We  welcome  other  groups  to  set  up  versions  of  AUIDs 
and  to  communicate  with  us  the  necessary  identification 
data  so  that  our  consortium  can  offer  WEB  links  to  en¬ 
tries  beyond  the  ones  prepared  by  its  current  members. 


IV.  DATA  PROVENANCE 

Reproducibility  is  a  key  tenet  of  the  scientific  method, 
but  replication  occurs  rarely.  As  experiments  become 
more  complex  and  costly  this  problem  will  only  worsen. 
A  particular  advantage  of  simulated  experiments  is  that 
they  are  uniquely  suited  for  reproducibility.  Imagine 
a  scenario  in  which  the  original  experimenter  publishes 
his/her  executable  workflow  with  the  necessary  input 
data.  Then,  anyone  wishing  to  validate  or  build  upon 
those  results  does  so  with  a  single  command  on  a  sys¬ 
tem  with  access  to  the  appropriate  software.  This  could 
be  done  so  that  all  the  software  tools,  input  and  output 
data  are  maintained  remotely,  lowering  cost,  improving 
ecological  sustainability  (saving  electricity)  and  increas¬ 
ing  collaboration. 

In  reality,  such  a  framework  is  not  yet  available  and 
its  implementation  is  not  so  straightforward.  Despite 
the  potential  advantages  of  computational  science  in  the 
realm  of  reproducibility,  the  crucial  data  provenance  is 
often  completely  neglected.  An  overall  culturally-driven 
disincentive  to  the  actual  reproduction  of  experiments 
is  pervasive.  Fortunately  there  are  indications  that  this 
problem  is  being  recognized  and  addressed  [2]. 

AFLOWLIB  data  is  reproducibly  structured — the 
workflow  and  input  parameters  are  defined  by  the 
AFLOW  software  and  a  single  master  input  file  aflow.  in, 
leading  to  easily  reproducible  input  parameters.  In  the 
case  of  Density  Functional  Theory-based  simulations, 
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Search  Aflowlib  (568388  Calculations  2014/02/13) 


Element(s) 
Species  number 


Usage:  &(and),  | (or). 


~(not),  'Rxor),  m(metal)  e.g.  ~Si  and  Al:  having  Al  but  not  Si 


Material  Type  |  any  type _ sj  Lattice  System  |  any  lattice  system  ;  1  Bravais  Lattice  |  any  lattice 


Space  Group  Number 


Pearson  Symbol 


(structure  properties) 


Minimum  band  gap  = 


eV 


Maximum  band  gap  = 


eV 


(electronic  properties) 


Minimum  <Pn>/L  = 
pW/cmK2nm 
Minimum  <Pp>/L  = 
pW/cmK2nm 

Minimum  Magnetic  Moment  = 


Maximum  <Pn>/L  = 
pW/cmK2nm 
Maximum  <Pp>/L  = 
pW/cmK2nm 

Maximum  Magnetic  Moment  = 
pB/atom 

Maximum  AS(EF)  = 


(thermoelectric  properties) 
(thermoelectric  properties) 
(magnetic  properties) 
(magnetic  properties) 


FIG.  2:  Project  WEB-interface  search  form:  zero-layer. 


this  includes  essential  calculation  parameters  such  as  the 
k-mesh  density,  the  energy  cut-off,  the  exchange  cor¬ 
relation  potential,  the  ab  initio  software  used,  and  the 
geometry  of  the  structures.  Combining  the  parameters 
contained  in  this  file  with  other  provenance  related  data 
(calculation  time,  memory,  code,  etc.),  the  AFLOW-API 
provides  curated  and  reproducible  data. 


V.  DATA  FORMATTING,  OWNERSHIP,  AND  A 
FEDERATABLE  FRAMEWORK 

Data  access  can  be  obtained  at  any  level  through 
the  API  with  the  appropriate  AURL  strings,  currently 
translated  into  WEB  inquiries.  WEB  forms  such  as  the 
one  shown  in  Fig  2  allow  the  user  to  search  within  a 
project  for  data  fitting  specified  criteria.  Alternatively, 
access  through  the  API  is  supported  by  several  data  for¬ 
mats:  “HTML”  [31],  “JSON”  [45],  “DUMP” ,  “PHP” 
[46],  “TEXT”,  and  “NONE”. 

“HTML”  (default):  $aurl/?f ormat=html. 

“JSON”  (javascript  syntax):  $aurl/?f ormat=json. 

“DUMP”  (php  constructs):  $aurl/?f ormat=dump. 


“PHP”  (valid  php  syntax):  $aurl/?f  ormat=php. 

“TEXT”  (no  syntax):  $aurl/?f ormat=text. 

“NONE”  (no  output):  $aurl/?f  ormat=none. 

The  format  option  is  intended  for  use  on  the  level  of 
the  entry  returning  the  whole  af lowlib .  out.  For  any 
given  property  in  the  set  of  keywords  contained  in  the 
entry,  the  mode  is  currently  to  return  a  simple  byte  se¬ 
quence  with  no  formatting.  Attempting  to  format  a  sin¬ 
gle  property  returns  the  full  property  set  in  the  speci¬ 
fied  format.  For  example,  the  AURL  $aurl/?density  re¬ 
turns  the  density  of  the  specified  structure.  More  specifi¬ 
cally,  for  $aurl=af lowlib. duke . edu: AFL0WDATA/LIB2_ 
RAW/AgTi_sv/66/?density  it  returns  “6.54346”.  How¬ 
ever,  . . . /AgTi_sv/66/?density&f ormat=html  is  equiv¬ 
alent  to  .  .  . /AgTi_sv/66/?f  ormat=html,  returning  the 
full  set  of  entry  properties.  “HTML”  is  primarily  for  in¬ 
teractive  use  where  keywords  and  files  can  be  promptly 
explored  with  web  browsers;  “JSON”  and  “PHP”  are 
valid  language  syntaxes  to  facilitate  the  data  access  by 
programmed  codes;  “DUMP”  allows  the  user  to  access 
in  his/her  own  method;  “TEXT”  returns  the  entry  in  a 
single  line  with  the  keywords  separated  by  “I”  so  that 
other  databases  can  be  built  on  top  of  AFLOWLIB.org; 
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aurl=af lowlib . duke . edu : AFL0WDATA/LIB3_RAW 
auid=af low:LIB_RAW3 
aapi=l . 0 

keywords=aurl , auid, af lowlib_entries_number , af lowlib_entries , keywords , aapi 
af lowl ib_entr ies_number=2  6333 
af lowl ib_entr ies=AgAlAs , AgAlAu , AgAlB_h , AgAlBa_sv , AgAlBe_sv , AgAlBi_d , AgAlBr , AgAlCa_s v , AgAlCd , 

AlHg, AgAl In_d , AgAlIr , AgAlK_sv , AgAlLa , AgAlLi_sv , AgAlMg_pv , AgAlMn_pv , AgAlMo_pv , 
AgAlRu_pv , AgAlSb , AgAlSc_s v , AgAlSe , AgAl Si , AgAlSn , AgAlSr_s v , AgAlTa_pv , AgAlTc_pv , 
AgAsBe_s v , AgAsBi_d , AgAsBr , AgAsCa_sv , AgAsCd , AgAsCl , AgAsCo , AgAsCr_pv , AgAsCu_pv , 
AgAsMg_pv , AgAsMn_pv , AgAsMo_pv, AgAsNa_sv, AgAsNb_sv, AgAsNi_pv, AgAsOs_pv, AgAsP , . . . 


Example  of  af  lowlib .  out  for  the  “project-layer” 


b) 


aurl=af lowlib . duke . edu : AFL0WDATA/LIB3_RAW/AgCoMn_pv 
auid=af low: AgCoMn_pv: PAW_PBE 
aapi=l . 0 

keywords=aurl , auid, af lowl ib_entries_number , af lowlib_entries , keywords , aapi 
af lowlib_entries_number=9 

af lowlib_entries=T000 1 . A2BC, T000 1 . AB2C , TOO 01 . ABC2 , TO 00 2 . A2BC, T0002 . AB2C , TO 00 2 .ABC 2 , . . 


Example  of  af  lowlib .  out  for  the  “set-layer” 


C) 


aurl=af lowlib. duke. edu :AFLOWDATA/LIB3_RAW/AgCoMn_pv/TO 002 . A2BC 
auid=af low: 36a7b730e762fddf 
aapi=l . 0 

keywords=aurl , auid, af low_api_version, code, compound, prototype, nspecies , . . . 

af lowlib_date=20140130_20 : 34 : 00_GMT-5 

af lowlib_version=30794 

af low_version=af low30293 

calculation_cores=l 

calculation_memory=539 

calculation_time=18347 . 2 

corresponding=Stef ano_Sanvito_sanvitos@tcd. ie 
loop=thermodynamics , bands , magnetic 
node_CPU_Cores=12 
node_CPU_MH z = 2  6  6 1 

node_CPU_Model=Intel ( R ) _Xeon ( R ) _CPU_X5  6  5  0_@_2 . 6  7GHz 

node_RAM_GB=2 4 

code=vasp .4.6.35 

composition=2 ,1,1 

compound=Ag2ColMnl 

density=8 .94193 

eentropy_cell=0 

e  e  n t  r opy_a t  om=  0 

Egap=0 

energy_cell=-20 . 4051 
energy _atom=- 5.10128 
enthalpy _cell=-20 . 4051 
enthalpy_atom=-5 .10128 
enthalpy_formation_cell=l . 51248 
enthalpy_formation_atom=0 .378121 
entropic_temperature=-4220 .27 

f iles=AgCoMn_pv. T0002 .A2BC.cif ,AgCoMn_pv.T0002 .A2BC.png, . . 
f orces=0 ,0,0;0,0,0;0,0,0;0,0,0 
geometry=4 .42361,4.42361,4.42361,60,60,60 


Example  of  af  lowlib .  out  for  the  “calculation-layer” 


lattice_system_orig=cubic 

lattice_system_relax=cubic 

lattice_variation_orig=FCC 

lattice_variation_relax=FCC 

natoms=4 

nbondxx=l. 09 11, 1.09 11, 1.09 11, 1.78 18, 1.0911, 1.7818 
nspecies=3 

Pearson_symbol_orig=cF16 

Pearson_symbol_relax=cF16 

positions_cartesian=0,0,0;4.691,4.691,4.691;3.127,3. 127,3.127; 1.563. . 

df t_type=PAW_PBE 

pressure=0 

prototype=T0  0 0  2 . A2  BC 

PV_cell=0 

PV_atom=0 

sg=F-43m#2 16 , F-43m#2 16 , F-43m#2 16 
sg2=F-43m#2 16 , F-43m#2 16 , F-43m#2 16 
spacegroup_orig=216 
spacegroup_relax=2 16 
species=Ag,Co,Mn 
specie  s_pp= Ag , Co , Mn_p v 

species_pp_version=Ag: 06Sep2000 ,Co: 06Sep2000 ,Mn_pv: 07Sep2 000 

spin_cell=5 .17619 

spin_atom=l .29405 

spinD=0. 035, -0.0 10, 1.498,3.635 

spinF=0 .618302 

stoichiometry=0 .5,0.25,0.25 

Egap_type=metal 

valence_cell_iupac=20 

volume_cell=61 .209 

volume_atom=15 .3023 


FIG.  3:  Examples  of  the  contents  defining  “aflowlib.out”  entries  for  the  project-layer  (panel  a),  for  the  set-layer  (panel  b),  and 
for  the  calculation-layer  (panel  c).  As  per  AFLOWLIB  REST- API  version  1.0  (this  document),  mandatory  keywords  are  listed 
in  red,  optional  control  keywords  are  in  green,  and  optional  materials  keywords  are  in  blue. 


and  “NONE”  may  be  useful  as  a  method  to  test  the  ex¬ 
istence  of  an  AURL  or  for  debug  purposes. 

Clear  attribution  of  contributed  data  is  essential  for  the 
development  of  distributed  databases  comprising  inputs 
from  a  wide  network  of  contributors.  AFLOW  facilitates 
attribution  with  the  AUID,  a  unique  and  persistent  iden¬ 
tifier,  that  includes  the  author,  laboratory,  group,  and 
affiliation  as  data  entry  fields.  The  shared  content  in  the 
database  is  simple  to  reduce  or  augment  according  to  a 
contributor’s  preference  and  the  attribution  is  ensured 
by  the  unique  identifier  and  contributor  labels  that  are 
accessible  with  the  AFLOWLIB- API. 

The  structure  of  the  AFLOWLIB  database  is  federated: 
Autonomous  members  of  the  consortium  (with  distinct 


geographical  locations  and  affiliations)  are  able  to  trans¬ 
parently  contribute  to  a  composite  database,  preserving 
ownership  and  claim  over  the  substance  of  their  data. 
The  underlying  meta-data  schema  of  the  contributed 
data  are  consistent  by  production,  to  ensure  the  clarity 
and  searchability  of  the  composite  database.  A  contrib¬ 
utor  to  the  consortium  begins  by  downloading  the  latest 
version  of  the  AFLOW  binary  (as  of  writing  this  paper 
this  is  version  30805)  and  interfacing  with  a  quantum 
code.  AFLOW  is  currently  configured  to  run  VASP  auto¬ 
matically.  Pre-  and  post-processing  is  functional  for  both 
VASP  and  Quantum  Espresso  so  that  agnostic  standard¬ 
ization  of  inputs  and  outputs  between  the  two  codes  can 
be  obtained. 
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VI.  TABLE  OF  PROPERTIES  AND  API  KEYWORDS 

This  section  includes  the  keywords  currently  present  in  the  database:  description,  type,  inclusion  policy  and  the 
AFLOWLIB  syntax  for  retrieval.  The  list  is  divided  into  mandatory,  optional  control  and  optional  materials  keywords. 
The  mandatory  keywords  must  be  present  in  every  entry  at  all  layers  of  the  database.  Some  of  the  optional  control 
keywords  appear  at  the  projects  and  systems  levels  while  others  appear  at  the  calculations  level.  The  optional 
materials  keywords  usually  appear  just  at  the  calculations  layer ,  and  not  all  of  them  are  present  in  all  of  the  entries. 
Each  entry  begins  with  the  AURL  and  AUID  keywords,  denoted  by  the  syntax  words  $aurl  and  $auid,  respectively. 


A.  Mandatory  keywords 

•  /  or  /?f ormat=html,  json,  dump,  php,  or  text. 

—  Description.  The  whole  entry.  It  can  be  read  in  different  formats.  For  the  calculation-layer  it  can  also  be 
read  as  /?af lowlib . out. 

—  Type,  lines  of  strings. 

—  Example.  See  examples  of  entries  at  different  levels  in  Figure  3. 

—  Request  syntax.  $aurl. 

•  auid 


—  Description.  “AFLOWLIB  Unique  Identifier”  for  the  entry,  AUID,  which  can  be  used  as  a  publishable 
object  identifier,  following  the  spirit  of  the  DOI  foundation  [43]  (see  Section  III). 

—  Type,  string. 

—  Example.  auid=af low: e9c6d914c4b8d9ca. 

—  Request  syntax.  $aurl/?auid. 


•  aurl 


—  Description.  “AFLOWLIB  Uniform  Resource  Locator”  returns  the  AURL  of  the  entry.  The  web  server  is 
separated  from  the  web  directory  with  “:” .  This  tautological  keyword,  aurl  returning  itself,  is  useful  for 
debug  and  hyper  linking  purposes. 

—  Type,  string. 

—  Example.  aurl=af lowlib . duke . edu : AFL0WDATA/LIB3_RAW/Bi_dRh_pvTi_sv/T0003 . ABC : LDAU2. 

—  Request  syntax.  $aurl/?aurl. 


•  aapi 

—  Description.  “AFLOWLIB”  version  of  the  entry,  API.  This  article  describes  version  1.0  of  the  REST- API. 

—  Type,  string. 

—  Example.  aapi=1.0. 

—  Request  syntax.  $  aurl/?  aapi. 

•  keywords 

—  Description.  This  includes  the  list  of  keywords  available  in  the  entry,  separated  by  commas.  All  of  the 
keywords  can  be  requested  to  the  database.  The  request  keywords  should  be  the  first  one  made,  so  that 
the  reader  is  made  aware  of  the  available  keywords. 

—  Type.  List  of  strings  separated  by  “,”. 

—  Example.  keywords=aurl , auid, loop, code, compound, prototype ,nspecies ,natoms , .... 

—  Request  syntax.  $aurl/?keywords. 
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B.  Optional  controls  keywords  (alphabetic  order) 

•  af lowlib_entries  (af lowlib_entries_number) 

—  Description.  For  projects  and  set-layer  entries  (see  Figure  1),  af  lowlib_entries  lists  the  available  sub¬ 
entries  which  are  associated  with  the  $aurl  of  the  subdirectories.  By  parsing  $aurl/?af  lowlib_entries 
(containing  $aurl/af  lowlib_entries_number  entries)  the  user  finds  the  further  locations  to  interrogate. 

—  Type.  Set  of  strings  separated  by  (number). 

—  Example.  aflowlib_entries=AgAl,AgAs,AgAu,AgB_h,AgBa_sv,AgBe_sv,AgBi_d,AgBr ,AgCa_sv, . . . 
(af lowlib_entries_number=1524). 

—  Request  syntax.  $aurl/?af lowlib_entries  ($aurl/af lowlib_entries_number). 

•  af lowlib_date  (af lowlib_version) 

—  Description.  Returns  the  date  (version)  of  the  AFLOW  post-processor  which  generated  the  entry  for  the 
library.  This  entry  is  useful  for  debugging  and  regression  purposes. 

—  Type,  string. 

—  Example,  af lowlib_date=20140204_13 : 10 : 39_GMT-5  (af lowlib_version=30794). 

—  Request  syntax.  $aurl/?af lowlib_date  ($aurl/?af lowlib_version). 

•  af low_version 

—  Description.  Returns  the  version  number  of  AFLOW  used  to  perform  the  calculation.  This  entry  is  useful 
for  debugging  and  regression  purposes. 

—  Type,  string. 

—  Example,  af low_version=af low30641. 

—  Request  syntax.  $aurl/?af  low_version. 

•  author 

—  Description.  Returns  the  name  (not  necessarily  an  individual)  and  affiliation  associated  with  authorship  of 
the  data.  Multiple  entries  are  separated  by  commas.  Spaces  are  substituted  with  to  aid  parsing. 

—  Type.  List  of  strings  separated  by  * 

—  Example.  author=Marco_Buongiorno_Nardelli , 0had_Levy , Jesus_Carrete. 

—  Request  syntax.  $aurl/?author. 

•  calculation_cores,  calculation_memory,  calculation_time 

—  Description.  Number  of  processors/cores,  maximum  memory,  total  time  used  for  the  calculation. 

—  Type,  number,  number,  number. 

—  Units,  adimensional,  Megabytes,  seconds. 

—  Example.  calculation_cores=32,  calculation_memory=8376 . 13,  calculation_time=140713. 

—  Request  syntax.  $aurl/?calculation_cores,  $aurl/?calculation_memory,  $aurl/?calculation_time. 

•  corresponding 

—  Description.  Returns  the  name  (not  necessarily  an  individual)  and  affiliation  associated  with  the  data  origin 
concerning  correspondence  about  data.  Multiple  entries  are  separated  by  commas.  Spaces  are  substituted 
with  to  aid  parsing. 

—  Type.  List  of  strings  separated  by 

—  Example.  corresponding=M_Buongiorno_Nardelli_mbn@unt . edu. 

—  Request  syntax.  $aurl/?corresponding. 

•  loop 

—  Description.  Informs  the  user  of  the  type  of  post-processing  that  was  performed. 


—  Type.  List  of  strings  separated  by 

—  Example.  loop=thermodynamics , bands , magnetic. 

—  Request  syntax.  $aurl/?loop. 

node_CPU_Cores ,  node_CPU_MHz,  node_CPU_Model,  node_RAM_GB 

—  Description.  Information  about  the  node/cluster  where  the  calculation  was  performed.  Number  of  cores, 
speed,  model,  and  total  memory  accessible  to  the  calculation. 

—  Type,  number,  number,  string,  number. 

—  Units.  MHz  for  speed,  gigabytes  for  RAM. 

—  Example.  node_CPU_Cores=12,  node_CPU_MHz=2661,  node_RAM_GB=48, 
node_CPU_Model=Intel (R) _Xeon (R) _CPU_X5650_@_2 . 67GHz. 

—  Request  syntax.  $aurl/?node_CPU_Cores,  $aurl/?node_CPU_MHz, 

$aurl/?node_CPU_Model,  $aurl/?node_RAM_GB. 

sponsor 

—  Description.  Returns  information  about  funding  agencies  and  other  sponsors  for  the  data.  Multiple  entries 
are  separated  by  commas.  Spaces  are  substituted  with  to  aid  parsing. 

—  Type.  List  of  strings  separated  by  • 

—  Example.  sponsor=D0D_N000141310635 ,NIST_70NANB12H163. 

—  Request  syntax.  $aurl/?sponsor. 

C.  Optional  materials  keywords  (alphabetic  order) 

Bravais_lattice_orig  (Bravais_lattice_relax) 

—  Description.  Returns  the  Bravais  lattice  [47]  of  the  original  unrelaxed  (relaxed)  structure  before  (after)  the 
calculation. 

—  Type,  string. 

—  Example.  Bravais_lattice_orig=MCLC  (Bravais_lattice_relax=MCLC). 

—  Request  syntax.  $aurl/?Bravais_lattice_orig  ($aurl/?Bravais_lattice_relax). 

code 

—  Description.  Returns  the  software  name  and  version  used  to  perform  the  simulation. 

—  Type,  string. 

—  Example.  code=vasp.4.6.35. 

—  Request  syntax.  $aurl/?code. 

composition 

—  Description.  Returns  a  comma  delimited  composition  description  of  the  structure  entry  in  the  calculated 

cell. 

—  Type.  List  of  number  separated  by 

—  Example.  composition=2,6,6.  (For  a  A2B6C6  compound). 

—  Request  syntax.  $aurl/?composition. 

compound 

—  Description.  Similar  to  composition.  Returns  the  composition  description  of  the  compound  in  the  calcu¬ 
lated  cell. 

—  Type.  Set  of  {string-number}. 

—  Example.  compound=Co2Er6Si6. 
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—  Request  syntax.  $aurl/? compound. 

•  density 

—  Description.  Returns  the  mass  density. 

—  Type,  number. 

—  Units,  grams/cm3. 

—  Example.  density=7 . 76665. 

—  Request  syntax.  $aurl/?density. 

•  dft_type 

—  Description.  Returns  information  about  the  pseudopotential  type,  the  exchange  correlation  functional  used 
(normal  or  hybrid)  and  use  of  GW. 

—  Type.  Set  of  strings  separated  by  “,” . 

—  Example.  If  the  calculations  were  performed  with  VASP  [32],  the  entry  could  include  “US”,  “GGA”, 
“PAWJLDA” ,  “PAW_GGA”,  “PAW_PBE”  ,  “GW”,  “HSE06”  (February  2014). 

—  Example.  dft_type=PAW_PBE,HSE06. 

—  Request  syntax.  $aurl/?dft_type. 

•  eentropy_cell  (eentropy_atom) 

—  Description.  Returns  the  electronic  entropy  of  the  unit  cell  used  to  converge  the  ab  initio  calculation 
(smearing). 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV  or  Ry  (eV/atom  or  Ry/atom)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example.  eentropy_cell=0 . 0011  (eentropy_atom=0 . 0003). 

—  Request  syntax.  $aurl/?eentropy_cell  ($aurl/?eentropy_atom). 

•  Egap 

—  Description.  Band  gap  calculated  with  the  approximations  and  pseudopotentials  described  by  other  key¬ 
words. 

—  Type,  number. 

—  Units.  eV. 

—  Example.  Egap=2.5. 

—  Request  syntax.  $aurl/?Egap. 

•  Egap_fit 

—  Description.  Simple  cross-validated  correction  (fit)  of  Egap.  See  Ref.  [9]  for  the  definition. 

—  Type,  number. 

—  Units.  eV. 

—  Example.  Egap_f  it=3 . 5. 

—  Request  syntax.  $aurl/?Egap_f  it. 

•  Egap.type 

—  Description.  Given  a  band  gap,  this  keyword  describes  if  the  system  is  a  metal,  a  semi-metal,  an  insulator 
with  direct  or  indirect  band  gap. 

—  Type,  string. 

—  Example.  Egap_type=insulator_direct. 

—  Request  syntax.  $aurl/?Egap_type 
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•  energy_cell  (energy. atom) 

—  Description.  Returns  the  total  ab  initio  energy  of  the  unit  cell  E  (energy  per  atom  — the  value  of 
energy_cell/7V) . 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV  or  Ry  (eV/atom  or  Ry/atom)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example.  energy_cell=-82 . 1656  (energy_atom=-5 . 13535). 

—  Request  syntax.  $aurl/?energy_cell  ($aurl/?energy_atom). 

•  energy.cutof f 

—  Description.  Set  of  energy  cut-offs  used  during  the  various  steps  of  the  calculations. 

—  Type.  Set  of  strings  separated  by 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV  or  Ry  (eV/atom  or  Ry/atom)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example,  energy.cutof f =384. 1,384. 1,384. 1. 

—  Request  syntax.  $aurl/?energy_cutof  f . 

•  enthalpy.cell  (enthalpy.atom) 

—  Description.  Returns  the  enthalpy  of  the  system  of  the  unit  cell  H  =  E  +  PV  (enthalpy  per  atom  — the 
value  of  enthalpy_cell/7V). 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV  or  Ry  (eV/atom  or  Ry/atom)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example.  enthalpy_cell=-82 . 1656  (enthalpy _atom=-5 . 13535). 

—  Request  syntax.  $aurl/?enthalpy_cell  ($aurl/?enthalphy_atom). 

•  enthalpy.f ormation.cell  (enthalpy.f ormation.atom) 

—  Description.  Returns  the  formation  enthalpy  A Hp  per  unit  cell  (AH p  atomic  per  atom).  For  compounds 
AnaBnb  •  *  *  with  Na  +  Nb  -  -  -  =  N  atoms  per  cell,  this  is  defined  as:  A Hp  =  E(AnaBnb  •••)  — 

[NaE (A) atomic  +  NBE(B)atornic  H - ]  (in  the  .atom  case  with  AxaBXb  •  •  •  and  xA  +  xB  •  •  •  =  1  we  have 

AHp atomic  =  B  (Ax  ABX  B  •  •  •  ) atomic  [% aD (A') atomic  H-  %  B  R  (D)  atomic  H-  '  ’  ’  ])• 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV  or  Ry  (eV/atom  or  Ry/atom)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example,  enthalpy.f ormation_cell=-33 . 1587  (enthalpy.f ormation.atom=-0 . 720841). 

—  Request  syntax.  $aurl/?enthalpy_f ormation.cell  ($aurl/?enthalpy_f ormat ion. atom). 

•  entropic.temperature 

—  Description.  Returns  the  entropic  temperature  as  defined  in  Ref.  [3,  7]  for  the  structure.  The  analysis  of 
formation  enthalpy  is,  by  itself,  insufficient  to  compare  alloy  stability  at  different  concentrations  and  their 
resilience  toward  high-temperature  disorder.  The  formation  enthalpy  represents  the  ordering- strength  of  a 
mixture  AxaBXbCxc  •  •  •  against  decomposition  into  its  pure  constituents  at  the  appropriate  concentrations 
xA ,  xB  xc,  •  •  • .  (A Hp  is  negative  for  compound  forming  systems).  However,  it  does  not  contain  informa¬ 
tion  about  its  resilience  against  disorder,  which  is  captured  by  the  entropy  of  the  system.  To  quantify  this 
resilience  we  define  the  entropic  temperature  for  each  compound  as: 


Ts(AXaBXbCXc 


-={ 


A  Hf(AxaBXbCxc 


kB  [XA  log(^A)  +  XB  log(:rB)  +  xc  log(xc) 


(1) 


where  the  sign  is  chosen  so  that  a  positive  temperature  is  needed  for  competing  against  compound  stability. 
This  definition  assumes  an  ideal  scenario  [3]  where  the  entropy  is  S' [{:£*}]  =  —  sAl  xt  l°g(.x-.().  Ts  is  a 
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concentration-maximized  formation  enthalpy  weighted  by  the  inverse  of  its  entropic  contribution.  Its 
maximum  Ts  =  ma ^phases  [ Ts(phases )]  represents  the  deviation  of  a  system  convex- hull  from  the  purely 
entropic  free-energy  hull,  — TS(pc ),  and  hence  the  ability  of  its  ordered  phases  to  resist  the  temperature- 
driven  deterioration  into  a  disordered  mixture  exclusively  promoted  by  configurational-entropy. 

—  Type,  number. 

—  Units.  Kelvin. 

—  Example.  entropic_temperature=1072 . 1. 

—  Request  syntax.  $aurl/?entropic_temperature. 

•  files 

—  Description.  Provides  access  to  the  input  and  output  files  used  in  the  simulation  (provenance  data). 

—  Type.  List  of  strings  separated  by 

—  Example,  f iles=Bi_dRh_pv . 33 . cif , Bi_dRh_pv . 33 . png , CONTCAR . relax , CONTCAR . relaxl , 

DOSCAR. static .bz2,EIGENVAL. bands .bz2,KP0INTS .bands .bz2,af low. in, edata. bands . out , 
edata . or ig . out , edata . relax . out , . . . 

—  Request  syntax.  $aurl/?f  iles. 

—  Description.  Once  the  “files”  list  has  been  parsed,  each  file  can  be  accessed  with  $aurl/file  (note  no 
“?”  for  accessing  individual  files). 


•  forces 

—  Description.  Final  quantum  mechanical  forces  (Fi,Fj,Fk)  in  the  notation  of  the  code. 

—  Type.  Triplets  (number, number, number)  separated  by  for  each  atom  in  the  unit  cell. 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV/A  or  a.u  if  the  calculations  were  performed  with  VASP  [32]  or 
QE  [33],  respectively. 

—  Example.  forces=0, -0.023928, 0.000197; 0,0. 023928, -0.000197; . . . 

—  Request  syntax.  $aurl/?f  orces. 

•  geometry 

—  Description.  Returns  geometrical  data  describing  the  unit  cell  in  the  usual  a,  6,  c,  <a, /?,  7  notation  (a  = 
{b,  c},/3  =  {c,  a},  7  =  {a,  b}. 

—  Type.  Sixtuplet  (number, number, number, number, number, number). 

—  Units.  a,b,c  are  the  natural  units  of  the  $code,  e.g.,  A  or  a.u.  (Bohr)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33],  respectively.  <a,yd,  7  are  in  degrees. 

—  Example.  geometry=18 .82,18.82,18.82,32.41,32.41,32.41. 

—  Request  syntax.  $aurl/?geometry. 

•  lattice_system_orig,  lattice_variation_orig 
(lattice_system_relax,  lattice_variation_relax) 

—  Description.  Return  the  lattice  system  [47,  48]  and  lattice  variation  (Brillouin  zone)  of  the  original-unrelaxed 
(relaxed)  structure  before  (after)  the  calculation.  See  Ref.  [14,  49]  for  the  lattice  variation  and  Brillouin 
zones  notations. 

—  Type,  string,  string. 

—  Example.  lattice_system_orig=rhombohedral,  lattice_variation_orig=RHLl 
(lattice_system_relax=monoclinic,  lattice_variation_relax=MCLCl). 

—  Request  syntax.  $aurl/?lattice_system_orig,  $aurl/?lattice_variation_orig 
($aurl/?lattice_system_relax,  $aurl/?lattice_variation_relax). 

•  kpoints 

—  Description.  Set  of  k-point  meshes  uniquely  identifying  the  various  steps  of  the  calculations,  e.g.  relaxation, 
static  and  electronic  band  structure  (specifiying  the  k-space  symmetry  points  of  the  structure). 
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—  Type.  Set  of  numbers  and  strings  separated  by  and 

—  Example.  kpoints=10, 10, 10; 16, 16, 16 ; G-X-W-K-G-L-U-W-L-K+U-X. 

—  Request  syntax.  $aurl/?kpoints. 

•  ldau.TLUJ 

—  Description.  This  vector  of  numbers  contains  the  parameters  of  the  “DFT+U”  calculations,  based  on  a 
corrective  functional  inspired  by  the  Hubbard  model  [50,  51].  Standard  values  in  the  AFLOWLIB.org 
library  come  from  Refs.  [9,  10].  There  are  four  fields  (T;{L};{U};{J>),  separated  by  The  first  field 
indicates  the  type  (T)  of  the  DFT+U  corrections:  type=l,  the  rotationally  invariant  version  introduced 
by  Liechtenstein  et  al.  [52];  type=2,  the  simplified  rotationally  invariant  version  introduced  by  Dudarev  et 
al.  [53].  The  second  field  indicates  the  /-quantum  number  ({L>,  one  number  for  each  species  separated  by 
“,”)  for  which  the  on-site  interaction  is  added  (-l=neglected,  0=s,  l=p,  2 =d,  3=/).  The  third  field  lists 
the  effective  on-site  Coulomb  interaction  parameters  ({U},  one  number  for  each  species  separated  by  “,”). 
The  fourth  field  specifies  the  effective  on-site  exchange  interaction  parameters  ({J>,  one  number  for  each 
species  separated  by  “,”).  Although  more  compact,  the  convention  is  similar  to  the  VASP  notation  [32]. 

—  Units,  a-dimensional;  {adimensional};  {eV};  {eV}. 

—  Type,  number; {number,-  •  •  }; {number,-  •  •  }; {number,-  •  •  }  . 

—  Example.  ldau_TLUJ=2;2,0,0;5,0,0;0,0,0. 

—  Request  syntax.  $aurl/?ldau_TLUJ. 

•  natoms 

—  Description.  Returns  the  number  of  atoms  in  the  unit  cell  of  the  structure  entry.  The  number  can  be  non 
integer  if  partial  occupation  is  considered  within  appropriate  approximations. 

—  Type,  number. 

—  Example.  natoms=12. 

—  Request  syntax.  $aurl/?natoms. 

•  nbondxx 

—  Description.  Nearest  neighbors  bond  lengths  of  the  relaxed  structure  per  ordered  set  of  species 

Ai,Aj>i.  For  pure  systems:  {p[AA\};  for  binaries:  {p  [AA\  ,  p  [AB]  ,  p  [BB]};  for  ternaries: 

{p  [AA] ,  p  [AB] ,  p  [AC] ,  p  [BB] ,  p  [BC] ,  p  [CC]}  and  so  on. 

—  Type.  Set  of  Nspecies (Nspecies  +  l)/2  numbers. 

—  Units.  Natural  units  of  the  $code,  e.g.,  A  or  a.u.  (Bohr)  if  the  calculations  were  performed  with  VASP  [32] 
or  QE  [33],  respectively. 

—  Example.  nbondxx=l. 2599, 1.0911, 1.0911, 1.7818, 1.2599, 1.7818  (for  a  three  species  entry). 

—  Request  syntax.  $aurl/?nbondxx. 

•  nspecies 

—  Description.  Returns  the  number  of  species  in  the  system  (e.g.,  binary  =  2,  ternary  =  3,  etc.). 

—  Type,  number. 

—  Example.  nspecies=3. 

—  Request  syntax.  $aurl/?nspecies. 

•  Pearson_symbol_orig  (Pearson_symbol_relax) 

—  Description.  Returns  the  Pearson  symbol  [49,  54]  of  the  original- unrelaxed  (relaxed)  structure  before  (after) 
the  calculation. 

—  Type,  string. 

—  Example.  Pearson_symbol_orig=mS32  (Pearson_symbol_relax=mS32). 

—  Request  syntax.  $aurl/?Pearson_symbol_orig  ($aurl/?Pearson_symbol_relax). 

•  positions_cartesian 
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—  Description.  Final  Cartesian  positions  (xi,Xj,Xk)  in  the  notation  of  the  code. 

—  Type.  Triplets  (number, number, number)  separated  by  for  each  atom  in  the  unit  cell. 

—  Units.  Natural  units  of  the  $code,  e.g.,  in  Cartesian  coordinates  (A)  if  the  calculations  were  performed 
with  VASP  [32]. 

—  Example.  positions_cartesian=0 , 0 , 0 ; 18 . 18438 ,0,2. 85027 ; . . . 

—  Request  syntax.  $aurl/?positions_cartesian. 

•  positions_f ractional 

—  Description.  Final  fractional  positions  (xi,Xj,Xk)  with  respect  to  the  unit  cell  as  specified  in  $geometry. 

—  Type.  Triplets  (number, number, number)  separated  by  f°r  each  atom  in  the  unit  cell. 

—  Units,  adimensional 

—  Example.  positions_fractional=0,0,0;0 .25,0 .25,0 .25; . . . 

—  Request  syntax.  $aurl/?positions_f ractional. 

•  pressure 

—  Description.  Returns  the  external  pressure  selected  for  the  simulation. 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  kbar  or  a.u.  (Ry/Bohr)  if  the  calculations  were  performed  with 
VASP  [32]  or  QE  [33],  respectively. 

—  Example.  pressure=10 . 0. 

—  Request  syntax.  $aurl/?pressure. 

•  prototype 

—  Description.  Returns  the  AFLOW  unrelaxed  prototype  which  was  used  for  the  calculation.  The  list  can 
be  accessed  with  the  command  “aflow  — protos”  or  by  consulting  the  online  links.  The  options  are  illustrated 
in  the  AFLOW  manual  [13].  Note  that  during  the  calculation,  unstable  structures  can  deform  and  lead  to 
different  relaxed  configurations.  It  is  thus  imperative  for  the  user  to  make  an  elaborate  analysis  of  the 
final  structure  to  pinpoint  the  right  prototype  to  report.  Differences  in  Bravais  lattices,  Pearson  symbol, 
space  groups,  for  the  _orig  and  _relax  versions  are  extremely  useful  for  this  task. 

—  Type,  string. 

—  Example.  prototype=T0001 .  A2BC. 

—  Request  syntax.  $aurl/?prototype. 

•  PV_cell  (PV_atom) 

—  Description.  Pressure  multiplied  by  volume  of  the  unit  cell  (of  the  atom). 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  eV  or  Ry  (eV/atom  or  Ry/atom)  if  the  calculations  were  performed 
with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example.  PV_cell=12 . 13  (PV_atom=3 . 03). 

—  Request  syntax.  $aurl/?PV_cell  ($aurl/?PV_atom). 

•  scintillation_attenuation_length 

—  Description.  Returns  the  scintillation  attenuation  length  of  the  compound  in  cm.  See  Refs.  [9,  55]. 

—  Type,  real  number. 

—  Example.  scintillation_attenuation_length=2 . 21895. 

—  Request  syntax.  $aurl/?scintillation_attenuation_length. 


•  sg  (sg2) 
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—  Description.  Evolution  of  the  space  group  of  the  compound  [47,  48].  The  first,  second  and  third  string 
represent  space  group  name/number  before  the  first,  after  the  first,  and  after  the  last  relaxation  of  the 
calculation. 

—  Tolerance,  sg  values  are  calculated  with  3.0%  and  0.5  deg  tolerances  for  lengths  and  angles,  respectively. 
(sg2  is  with  1.5%  and  0.25  deg).  Symmetry  is  cross  validated  through  the  internal  engines  of  AFLOW  [13], 
PLATON  [56],  and  FINDSYM  [57]. 

—  Type.  Triplet  string, string, string. 

—  Example.  sg=Fm-3m#225,Fm-3m#225,Fm-3m#225  (sg2=R-3c  #167,R-3c  #167,R-3c  #167). 

—  Request  syntax.  $aurl/?sg  ($aurl/?sg2). 

•  spacegroup_orig  (spacegroup_relax) 

—  Description.  Returns  the  spacegroup  number  [47]  of  the  original-unrelaxed  (relaxed)  structure  before  (after) 
the  calculation. 

—  Tolerance.  Same  as  sg. 

—  Type,  number. 

—  Example.  spacegroup_orig=225  (spacegroup_relax=225). 

—  Request  syntax.  $aurl/?spacegroup_orig  ($aurl/?spacegroup_relax). 

•  species,  species_pp,  species_pp_version 

—  Description.  Species  of  the  atoms,  pseudopotentials  species,  and  pseudopotential  versions. 

—  Type.  List  of  strings  separated  by 

—  Example.  species=Y,Zn,Zr,  species_pp=Y_sv,Zn,Zr_sv, 

species_pp_version=Y_sv : PAW_PBE : 06Sep2000 , Zn : PAW_PBE : 06Sep2000 , Zr_sv : PAW_PBE : 07Sep2000. 

—  Request  syntax.  $aurl/?species,  $aurl/?species_pp,  $aurl/?species_pp_version. 

•  spin_cell  (spin_atom) 

—  Description.  For  spin  polarized  calculations,  the  total  magnetization  of  the  cell  (magnetization  per  atom). 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  /ib  (Bohr  magneton). 

—  Example.  spin_cell=2 . 16419  (spin_atom=0 . 541046). 

—  Request  syntax.  $aurl/?spin_cell  ($aurl/?spin_atom). 

•  spinD 

—  Description.  For  spin  polarized  calculations,  the  spin  decomposition  over  the  atoms  of  the  cell. 

—  Type.  List  of  numbers  separated  by  . 

—  Units.  Natural  units  of  the  $code,  e.g.,  /jlb  (Bohr  magneton). 

—  Example.  spinD=0 . 236 , 0 . 236 ,  -0 . 023 , 1 . 005. 

—  Request  syntax.  $aurl/?spinD. 

•  spinF 

—  Description.  For  spin  polarized  calculations,  the  magnetization  of  the  cell  at  the  Fermi  level. 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  /jlb  (Bohr  magneton). 

—  Example.  spinF=0 . 410879. 

—  Request  syntax.  $aurl/?spinF. 

•  stoichiometry 

—  Description.  Similar  to  composition,  returns  a  comma  delimited  stoichiometry  description  of  the  structure 
entry  in  the  calculated  cell. 
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—  Type.  List  of  number  separated  by  • 

—  Example.  stoichiometry=0 . 5,0 . 25,0 . 25. 

—  Request  syntax.  $aurl/?stoichiometry. 

•  valence_cell_std  (valence_cell_iupac) 

—  Description.  Returns  standard  valence  (IUPAC  valence,  the  maximum  number  of  univalent  atoms  that 
may  combine  with  the  atoms  [58]). 

—  Type,  number. 

—  Example.  valence_cell_std=22  (valence_cell_iupac=12) 

—  Request  syntax.  $aurl/?valence_cell_std  ($aurl/?valence_cell_iupac). 

•  volume_cell  (volume_atom) 

—  Description.  Returns  the  volume  of  the  unit  cell  (per  atom  in  the  unit  cell). 

—  Type,  number. 

—  Units.  Natural  units  of  the  $code,  e.g.,  A3  or  Bohr3  (A3 /atom  or  Bohr3 /atom)  if  the  calculations  were 
performed  with  VASP  [32]  or  QE  [33] ,  respectively. 

—  Example.  volume_cell=100 . 984  (volume_atom=25 . 2461). 

—  Request  syntax.  $aurl/?volume_cell  ($aurl/?volume_atom). 


VII.  EXAMPLES 
A.  Generating  a  phase  diagram 

In  this  example  we  introduce  the  steps  to  generate  a  bi¬ 
nary  phase  diagram  at  zero  temperature.  As  an  example, 
we  choose  the  system  OsTc  [7,  20,  59],  and  we  illustrate 
the  logical  steps  for  obtaining  it.  The  user  should  prepare 
his/her  own  computer  code  to  download  and  analyze  the 
data  as  suggested. 

1.  Upon  interrogation  of  the  AFLOWLIB.org  database 
( database  searches  layer ,  see  Figures  1  and  2),  OsTc 
is  found  to  be  part  of  the  project-layer  with  AURL 
$aurl=af lowlib . duke . edu : AFL0WDATA/LIB2_RAW/. 
This  $aurl  is  translated  into  the  WEB  address  $web= 
http : //af lowlib . duke . edu/ AFL0WDATA/LIB2_RAW. 

2.  The  user  downloads  and  parses  the  query  $web/ 
Tkeywords.  Being  in  a  project-layer ,  a  better  and  faster 
alternative  is  to  download  the  entries’  number  and  type 
with  the  queries  $web/?af lowlib_entries  and  $web/ 
?af lowlib_entries_number.  The  user  then  parses 
$web/?af lowlib_entries  and  the  string  Os_pvTc_pv 
associated  with  the  requested  OsTc  phase  diagram. 

3.  The  user  downloads  and  parses  the  $web/0s_ 
pvTc_pv/  part  of  a  set-layer.  The  process  can  be 
accelerated  by  querying  $web/Os_pvTc_pv/?af lowlib_ 
entries  and  $web/Os_pvTc_pv/?af lowlib_entries_ 
number  directly,  to  find  further  $aurl.  The  results  are: 
$aflowlib_entries=l ,2,3,4, . . ,657.AB,657.BA, . ., 
$af lowlib_entries_number=260.  We  enumerate  and 
label  these  260  entries  with  Sentry^. 

4.  The  user  loops  through  $entry^,Vi  and  collects: 
$web/Os_pvTc_pv/$entry^/?stoichiometry  and 


$web/Os_pvTc_pv/$entry^/?enthalpy_f  ormation. 

5.  Finally,  the  user  collects  the  free  energies  and  plots 
the  convex  hull  as  depicted  in  Figure  4. 

6.  The  whole  process  can  be  performed  with  the 
AFLOW  code.  The  command  “aflow  —alloy  OsTc  —update 
— server=af lowlib.org”  connects  to  the  right  server,  down¬ 
loads  the  information,  calculates  the  free  energy  curve 
and  prepares  a  PDF  document  with  the  appropriate  in¬ 
formation  and  hyperlinks  to  the  individual  entries.  See 
the  AFLOW  literature  for  more  options  [13].  The  user 
still  has  to  double  check  the  final  relaxed  structure  pro¬ 
totypes.  This  is  performed  with  a  combination  of: 
$web/Os_pvTc_pv/$entryi/?compound, 
$web/Os_pvTc_pv/$entry^/?geometry, 
$web/Os_pvTc_pv/$entry^/?positions_cartesian, 
$web/Os_pvTc_pv/$entry^/?prototype, 

including  files  such  as: 

$web/Os_pvTc_pv/$entry^/edata. orig. out,  and 
$web/Os_pvTc_pv/$entry^/edata .  relax .  out, 
and  verifying  the  results  by  consulting  appropriate  pro¬ 
totype  databases  (e.g.,  the  Naval  Research  Laboratory 
Crystal  Structure  database ,  Ref.  [36]). 


B.  Obtaining  band  structures 

In  this  example,  we  introduce  the  steps  to  obtain  the 
band  structure  and  density  of  states  plots  for  a  calcu¬ 
lated  material.  As  an  example,  we  choose  the  compound 
Al2CuMn.  The  user  should  prepare  his/her  own  com¬ 
puter  code  to  download  and  analyze  the  data  as  sug¬ 
gested. 
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FIG.  4:  Phase  diagram  of  OsTc  automatically  calculated 
through  the  AFLOWLIB  API  [7,  20,  59]. 


1.  Upon  interrogation  of  the  AFLOWLIB.org 
database  ( database  searches  layer ,  see  Figures  1 
and  2),  AUCuMn  is  found  to  be  part  of  the 
project-layer  with  AURL  $aurl=af lowlib.duke . edu: 
AFL0WDATA/LIB3_RAW/.  This  $aurl  is  translated  into 
the  WEB  address  $web=http :  / /af lowlib .  duke .  edu/ 
AFL0WDATA/LIB3_RAW. 

2.  Within  the  project-layer ,  the  user  parses  the  query 
$web/?af  lowlib_entries,  which  shows  that  the  string 
AlCu_pvMn_pv  is  associated  with  the  requested  AlCuMn 
ternary  system. 

3.  The  user  parses  $web/AlCu_pvMn_pv/.  The  query 

is  part  of  a  set-layer.  In  this  case,  the  set  contains  10  en¬ 
tries ,  namely  the  calculation  for  the  AlCu_pvMn_pv  sys¬ 
tem  in  the  prototypes  ICSD_57695 . ABC,  T0001.{A2BC, 
AB2C ,  ABC2}  (Heusler  configurations) ,  T0002 .  {A2BC , 

AB2C, ABC2}  (inverse  Heusler),  and  T0003 . {ABC, BCA, 
CAB}  (half- Heusler) .  The  postfixes  A,B,C  indicate  the 
positions  of  the  species  in  the  prototype;  e.g.  A2BC  indi¬ 
cates  the  material  AUCuMn. 

4.  Using  the  ?enthalpy_f  ormation  and  ?loop  queries 
for  these  10  entries  the  user  finds  which  are  stable  and 
include  a  band  structure  calculation  (indicated  by  a  nega¬ 
tive  formation  enthalpy  and  the  string  bands  in  the  ?loop 
query  output).  For  this  example,  the  user  selects  the  en¬ 
try  $web/AlCu_pvMn_pv/T0001 .  A2BC/  for  the  compound 
AUCuMn,  which  satisfies  both  queries. 

5.  At  the  calculation-layer ,  the  user  finds  the  full 
af  lowlib.  out  entry.  By  interrogating  $web/AlCu_ 
pvMn_pv/T0001 .  A2BC/?f  iles,  the  user  obtains  a  list  of 
all  of  the  files  available  for  download  for  this  calculation, 
including: 

•  the  input  file  for  the  calculation  $web/AlCu_pvMn_ 
pv/TOOOl . A2BC/af low . in, 

•  the  spin  polarized  band  structure  with  elec¬ 
tronic  density  of  states  $web/AlCu_pvMn_pv/ 
T0001 . A2BC/AlCu_pvMn_pv . T000 1 . A2BC . png 


•  the  Brillouin  Zone  in  AFLOW  notation  [49]  AlCu_ 
pvMn.pv . T0001 . A2BC.BZ . png 

•  the  total  electronic  density  of  states  AlCu_pvMn_ 
pv . T0001 . A2BC.D0S . png 

•  the  partial  electronic  density  of  states  for  inequiv¬ 
alent  positions  “Al” -AlCu_pvMn_pv .  T000 1 .  A2BC_ 
PED0S_1_4_A1 . png,  “Cu”-AlCu_pvMn_pv . T0001 . 
A2BC_PED0S_3_4_Cu . png,  and  “Mn”-AlCu_pvMn_ 
pv . T0001 . A2BC_PED0S_4_4_Mn . png 

A  collage  of  these  files  is  shown  in  Figure  5. 

Also  available  in  the  calculation-layer  for  this  entry  are 
the  input  and  output  files  from  the  VASP  and  AFLOW 
runs,  that  the  user  may  extract  from  the  output  of  the 
$web/AlCu_pvMn_pv/T0001 .  A2BC/?f  iles  command. 


C.  Synergy  of  experimental  and  calculation  data 
on  a  rare  prototype 

The  experimental  data  on  binary  alloys  contains  many 
gaps.  It  also  presents  a  huge  panoply  of  structural  proto¬ 
types,  ranging  from  very  common  ones,  appearing  in  hun¬ 
dreds  of  compounds,  to  very  rare  ones  appearing  in  just  a 
few  systems.  HT  calculations  can  be  used  to  bridge  those 
gaps  and  provide  a  more  complete  picture  about  the  exis¬ 
tence  of  yet  unobserved  compounds  and  their  structures. 
They  can  also  considerably  extend  the  predicted  range 
of  those  rare  prototypes,  indicating  their  existence  in  a 
larger  set  of  binary  systems.  One  such  example  studied 
the  prevalence  of  the  PtgTi  prototype.  This  structure 
has  been  experimentally  observed  in  11  systems,  but  a 
high-throughput  search  over  all  of  the  binary  transition 
internet  allies  revealed  it  should  be  stable  at  low  tem¬ 
peratures  in  59  systems  [25].  The  study  verified  all  the 
experimental  occurrences  while  offering  additional  pre¬ 
dictions,  including  a  few  surprising  ones  in  supposedly 
well-characterized  systems  (e.g.,  Cu-Zn).  This  example 
serves  as  a  striking  demonstration  of  the  power  of  the 
high-throughput  approach.  In  this  section  we  present 
a  new  example,  discussing  recent  reports  observing  the 
rare  prototype  Pd4Pu3  in  a  few  transition  metal  binaries 
and  computationally  predicting  a  considerable  extension 
of  its  stability  or  metastability  in  such  systems. 

The  Pd4Pu3  (hR14,  space  group  #148)  was  first  ob¬ 
served  in  its  eponymous  system  in  1967  [61,  62].  It 
has  since  been  reported  in  37  additional  binary  systems, 
mostly  of  a  lanthanide  or  an  actinide  with  the  elements 
Pt  or  Pd.  [63].  Only  6  compounds  of  this  prototype  have 
been  reported  in  transition  metal  binary  systems:  N4TU 
[64,  65],  Pd4Y3  [66],  Pd4Zr3  [67],  Pt4Zr3  [68],  Rh4Zr3 
[69],  and  most  recently  Hf3Pt4  [70].  In  these  compounds, 
one  component  is  a  3B  or  4B  element  and  the  other  is 
from  the  ninth  or  tenth  column  of  the  periodic  table.  In 
this  example  we  wish  to  examine  the  possible  appearance 
of  this  prototype  in  all  transition  metal  binary  systems 
of  these  columns  (30  systems) .  This  can  be  done  in  a  few 
steps,  as  follows: 
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FIG.  5:  Band  structure,  total  and  partial  electronic  densities  of  states  for  A^CuMn,  as  available  through  the  AFLOWLIB  API. 
The  partial  density  of  states  is  calculated  only  for  inequivalent  atomic  positions  [compound  AUID=aflow:83cd2da0257e8def]. 


1.  Consulting  the  complete  list  of  structure  designa¬ 
tions  of  AFLOW  with  the  command  “aflow  —  protos”  or 
by  the  online  links,  the  user  finds  the  label  of  the  proto¬ 
type,  in  this  case  655 .  AB  or  655 .  BA  (depending  on  the 
order  of  the  species). 

2.  Using  $aurl=af lowlib . duke . edu : AFLOWDATA/ 
LIB2_RAW/?af lowlib_entries  the  user  finds  the  en¬ 
try  name  for  each  of  those  30  systems  in  the  set-layer. 
Then,  using  $aurl=af lowlib .duke . edu: AFLOWDATA/ 
LIB2_RAW/XXX/?af lowlib_entries  for  each  of  those 
names  the  user  finds  the  calculations  of  the  desired  pro- 


Co 


Rh 


Ni 


Pt 


Pd 


m 


mi 


Sc 


Zr 


Hf 


Ti 


M 

f  !!  !QQ  _ 

m 


FIG.  6:  Pettifor-type  structure  map  of  the  Pd4Pu3  phase 
in  transition  metal  binary  systems.  Both  axes  are  labeled 
by  increasing  Mendeleev  number  after  Pettifor  [60].  Colors 
denote  reported  compounds  with  indicated  year  of  discov¬ 
ery  (green  and  light  blue)  and  prediction  of  unreported  com¬ 
pounds  (red  and  orange)  found  to  be  stable  (green  and  red)  or 
metastable  (light  blue  and  orange)  in  the  calculations.  Sys¬ 
tems  where  the  structure  is  unstable  (formation  enthalpy  of 
more  than  30  meV/atom  above  the  convex  hull)  are  denoted 
in  blue.  The  square  parentheses  denote  the  formation  en¬ 
thalpy  of  metastable  and  unstable  structures  above  the  con¬ 
vex  hull  of  the  respective  system. 


totype  in  the  calculait on-layer  (indicated  by  the  string 
655. BA  or  655. BA  in  the  query  output). 


3.  Following  the  steps  of  example  VII A  the  user  con¬ 
structs  the  convex  hull  for  each  of  these  systems  and  finds 
the  position  of  the  desired  structure  in  it,  as  either  stable, 
metastable  or  unstable. 


Following  these  steps  for  the  Pd4Pu3  prototype  we 
find  that  it  appears  as  a  low  temperature  stable  com¬ 
pound  in  six  systems,  two  reported  in  experiments  and 
four  newly  predicted  ones.  The  structure  is  also  found 
to  be  metastable  (with  less  than  30meV/atom  above  the 
respective  system  convex  hull)  in  ten  systems,  of  which  it 
was  reported  in  four  by  experiments,  and  is  predicted  in 
six  additional  ones.  Among  the  predicted  phases,  three 
compounds  of  the  same  stoichiometry,  Pt4Y3,  Hf3Pd4 
and  Hf3Rh4,  are  reported  with  an  unknown  structure 
in  the  experimental  literature  but  identified  with  the 
Pd4Pu3  structure  in  the  calculations.  Overall,  the  cal¬ 
culation  extends  the  prevalence  of  this  prototype  (stable 
or  metastable)  among  transition  metal  binaries  from  five 
systems  to  sixteen.  Figure  6  summarizes  these  results. 
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D.  Bash  api.sh  example 


This  “bash”  script  example  api.sh  downloads  an  aflowlib . out  entry  for  the  project -,  set-,  or  calculation-layers 
of  the  binary  alloys. 


# ! /bin/bash 

SERVER= J  http : // aflowlib . duke . edu ; 

PRO JECT=  ’  AFL0WDATA/LIB2_RAW/ ’ 

#URL= $  SERVER ’ / ’  $PR0 JECT 

URL= $ SERVERS / ’  $PR0 JECT ’ Os_pvTc_pv/ ’ 

#URL= $  SERVER ’ / ’ $PR0 JECT ’ /0s_pvTc_pv/657 . AB/ ’ 
IFS=>,>; 

for  key  in  $(wget  -q  -0  -  ${URL}?keywords) ; 
do 

val=$(wget  -q  -0  -  ${URL}?${key}) ; 
echo  "${key}=${val}" ; 

done 


#  run  sh  ./api.sh 

#  server  name 

#  project  name 

#  project-layer 

#  set-layer 

#  calculation-layer 

#  options 

#  get  all  keywords 

#  loop  keywords 

#  assign  one  keyword 

#  print  keyword 

#  end  loop 


E.  Python  api.py  example 


This  “python3”  script  example  api  .py  downloads  an  aflowlib.  out  entry  for  the  project -,  set-,  or  calculation-layers 
of  the  Heusler  alloys  database. 


# ! /usr/bin/python3 
import  j  son 

from  urllib . request  import  urlopen 
SERVERS  http : //aflowlib . duke . edu; 

PRO JECT=  ’  AFL0WDATA/LIB3_RAW/ ’ 

#URL= S  ER VER+ ’ / ’  +PR0 JECT 

#URL= S  ER VER+ ’ / ; +PR0JECT+ J  AlCu_pvMn_pv/ ’ 

URL=SERVER+ ’ / ’ +PR0JECT+ ’  AlCu_pvMn_pv/T0001 . A2BC/ ’ 

entry= j  son . loads (urlopen (URL+ ; ?f  ormat= j son ; ) . readall ( ) . decode ( ; utf -8 ; ) ) 
for  key  in  entry: 

print (  "{}={}". format (key ,  entry  [key])  ) 


#  python3 

#  preamble 

#  preamble 

#  server  name 

#  project  name 

#  project-layer 

#  set-layer 

#  calculation-layer 

#  load 

#  loop  keys 

#  print  key 


VIII.  UPDATES  BEYOND  VERSION  1.0 

Standards,  like  databases,  are  as  good  as  the  updates 
they  receive  when  new  quantities  and  descriptors  become 
available.  The  list  of  keywords  available  in  the  current 
version  of  the  standard  is  far  from  being  complete  for  ra¬ 
tional  materials  design.  The  user  is  invited  to  search  and 
consult  appropriate  API  specifications  addenda,  which 
will  be  published  periodically  through  the  consortium 
website  AFLOWLIB.org.  The  entries’  API  version  can 
be  found  by  inquiring  the  keyword  $aurl/?aapi  as  de¬ 
scribed  above. 


IX.  CONCLUSION 

The  AFLOWLIB  API  provides  a  simple  and  powerful 
tool  for  accessing  a  large  set  of  simulated  materials  prop¬ 
erties  data.  This  will  allow  the  community  to  make  use  of 
AFLOWLIB  to  the  fullest  extent  possible,  through  search 


formats  allowing  complete  accessibility  of  the  database 
contents  at  all  levels  and  integration  of  search  results  into 
externally  formulated  workflows.  Such  workflows  may 
execute  any  type  of  investigation  on  the  obtained  data, 
ranging  from  a  simple  study  of  the  properties  of  a  specific 
material  to  extensive  statistical  analyses  of  whole  struc¬ 
ture  classes  for  materials  prediction.  The  full  provenance 
of  the  data  produced  is  provided,  following  a  standard  of 
reproducible  and  transparent  scientific  data  sharing,  to 
facilitate  its  straightforward  reproduction  and  extension. 

The  AFLOWLIB  database  is  growing  continually  by 
updating  existent  alloy  libraries  and  adding  new  ones 
(e.g.,  recent  attention  is  focused  on  ternary  systems  and 
electronic  properties).  The  new  API  described  in  this  pa¬ 
per  is  built  on  top  of  the  AFLOW  framework,  developed 
to  create  the  database  and  to  interrogate  it,  but  it  can  be 
easily  extended  to  other  materials  design  environments. 
It  is  constructed  as  a  federatable  tool  to  maximize  the 
utility  of  the  database  to  the  scientific  community  and 
expedite  scientific  collaboration  with  particular  empha¬ 
sis  on  reproducibility,  accessibility  and  attribution. 
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